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Abstract 

In this paper we present a steepest descent method with Armijo's rule for multicriteria 
optimization in the Riemannian context. The well defmedness of the sequence generated by the 
method is guaranteed. Under mild assumptions on the multicriteria function, we prove that each 
accumulation point (if they exist) satisfies first-order necessary conditions for Pareto optimality. 
Moreover, assuming quasi-convexity of the multicriteria function and non-negative curvature of 
the Riemannian manifold, we prove full convergence of the sequence to a Pareto critical. 
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1 Introduction 

Consider o following minimization problem 

mm F(p) m 
s.t. pel 1 ' 

In case that F : W 1 — > M and X = W 1 , the steepest descent method with Armijo's rule generates a 
sequence {p k } as follows 

P k+1 =p k + t k v k , v k = -F'(p k ), k = 0, 1, ... , 

where 

t k = max{2~ j : F(p k - tv k ) < F(p k ) + /3v k ,j = 0, 1 . . .}, 
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(3 € (0, 1). If F is continuously differentiable, classic results assure only that any accumulation point 
of {p k }, case there exist, are critical of F. This fact was generalized for multicriteria optimization by 
Fliege and Svaiter p3], namely, whenever the objective function is a vectorial function F : R ra —> M m 
and the partial order in M m is the usual, i.e., the componet-wise order. Full convergence is assured 
under the assumption that the solution set of the problem ([1]) is not-empty and F : M n — > M is a 
convex function, see Burachik et al. [6] (or, more generally, a quasi-convex function, see Kiwiel and 
Murty [H]), which has been generalized for vector optimization by Graha Drummond and Svaiter 
|17j (see also, Graha Drummond and Iusem j!6j). 

Extension of concepts, techniques as well as methods from Euclidean spaces to Riemannian 
manifolds is natural and, in general, nontrivial. In the last few years, such extension setting with 
purpose practical and theoretical has been the subject of many new research. Recent works dealing 
with this issue include [U El (H [H [TH (H [121 EH [SI E31 EH [Ml EH EH [38]. The generalization of 
optimization methods from Euclidean space to Riemannian manifold have some important advan- 
tages. For example, constrained optimization problems can be seen as unconstrained one from the 
Riemannian geometry viewpoint (the set constrained is a manifold) and, in this case, we have an 
alternative possibility besides the projection idea for solving the problem. Moreover, nonconvex 
problems in the classical context may become convex through the introduction of an appropriate 
Riemannian metric (see, for example [8]). 

The steepest descent method for the problem ([I]), in the particular case that X = M (M a Rie- 
mannian manifold) and F : M — > M is continuously differentiable has been studied by Udriste [35], 
Smith |34] and Rapcsak |31] and partial convergence results were obtained. For the convex case 
the full convergence, using Armijo's rule, has been generalized by da Cruz Neto et al. [7], in the 
particular case that M has non-negative curvature. Regarding to the same restrictive assumption 
on the manifold M, Papa Quiroz et al. [28] generalized the full convergence result using generalized 
Armijo's rule for the quasiconvex case. 

In this paper, following the ideas of Fliege and Svaiter |14] . we generalize its converge results 
for multicriteria optimization to the Riemannian context. Besides, following the ideas of Graha 
Drummond and Svaiter [17] . we generalize the full convergence result for multicriteria optimiza- 
tionin in the case that the multicriteria function is quasi-convex and the Riemannian manifold has 
non-negative curvature. 

The organization of our paper is as follows. In Section [2] we define the notations and list some 
results of Riemannian geometry to be used throughout this paper. In Section [3] we present the mul- 
ticriteria problem, the first order optimality condition for it and some basic definitions related. In 
the Section [4] we state the Riemannian steepest descent methods for solving multicriteria problems 
and establish the well definition of the sequence generated for it. In Section [5] we prove a partial 
convergence result without any additional assumption on F besides the continuity differentiable 
and, assuming quasi-convexity of F and not-negative curvature for M, a full convergence result 
is presented. Finally, in Section [6] we present some examples of complete Riemannian manifolds 
with explicit geodesic curves and the steepest descent iteration of the sequence generated by the 
proposed method. 



2 



2 Preliminaries on Riemannian geometry 

In this section, we introduce some fundamental properties and notations of Riemannian manifold. 
These basics facts can be found in any introductory book of Riemannian geometry, for example in 
[9] and [33]. 

Let M be a n-dimentional connected manifold. We denote by T p M the n-dimensional tangent 
space of M at p, by TM = \J p ^mT p M tangent bundle of M and by X{M) the space of smooth 
vector fields over M. When M is endowed with a Riemannian metric (, ), with corresponding 
norm denoted by || ||, then M is now a Riemannian manifold. Recall that the metric can be used to 
define the length of piecewise smooth curves 7 : [a, b] — > M joining p to q, i.e., such that 7(a) = p 
and 7(6) = q, by 



and, moreover, by minimizing this length functional over the set of all such curves, we obtain a 
Riemannian distance d(p, q) which induces the original topology on M. The metric induces a map 
/ 1— > grad/ S X(M) which associates to each scalar function smooth over M its gradient via the 
rule (grad /, X) = df(X), X 6 X{M). Let V be the Levi-Civita connection associated to (M, ( , )). 
A vector field V along 7 is said to be parallel if VyV = 0. If 7' itself is parallel we say that 7 is a 
geodesic. Because the geodesic equation V y 7' = is a second order nonlinear ordinary differential 
equation, then the geodesic 7 = J v (-,p) is determined by its position p and velocity v at p. It is easy 
to check that |[7'|| is constant. We say that 7 is normalized if ||7'|| = 1. The restriction of a geodesic 
to a closed bounded interval is called a geodesic segment. A geodesic segment joining p to q in M 
is said to be minimal if its length equals d(p, q) and this geodesic is called a minimizing geodesic. 
If 7 is a curve joining points p and q in M then, for each t € [a,b], V induces an linear isometry, 
relative to ( , ), P 7 ( w t ) : T 7 r a \M — > T^^M, the so-called parallel transport along 7 from 7(a) to 
7(i). The inverse map of Py( a )y(t) is denoted by Pyr a ^u\ '■ T^^M — > T 7 ( a )M. In the particular case 
of 7 is the unique curve joining points p and q in M then parallel transport along 7 from p to q is 
denoted by P pq : T p M ->• T g M. 

A Riemannian manifold is complete if geodesies are defined for any values of t. Hopf-Rinow's 
theorem asserts that if this is the case then any pair of points, say p and q, in M can be joined by 
a (not necessarily unique) minimal geodesic segment. Moreover, (M, d) is a complete metric space 
and bounded and closed subsets are compact. Take p £ M, the exponential map exp p : T p M — > M 
is defined by exp p v = 7„(l,p). 

We denote by R the curvature tensor defined by R(X, Y) = Vx^yZ — Vy^xZ — ^[X,Y]Z, 
with X, Y, Z € X(M), where [X, Y] = YX — XY. Then the sectional curvature with respect to X 



and Y is given by K(X,Y) = (R(X,Y)Y, X) /(\\X\\ 2 \\Y\\ 2 - {X , Y) 2 ), where ||X|| = (X,X) 2 . 



In the subsection 15.21 of this paper, we will be mainly interested in Riemannian manifolds for 
which K(X,Y) > for any X, Y € X{M). Such manifolds are referred to as manifolds with 
nonnegative curvature. A fundamental geometric property of this class of manifolds is that the 
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distance between geodesies issuing from one point is, at least locally, bounded from above by the 
distance between the corresponding rays in the tangent space. A global formulation of this general 
principle is the law of cosines that we now pass to describe. A geodesic hinge in M is a pair of 
normalized geodesies segment 71 and 72 such that 71(0) = 72(0) and at least one of them, say 71, 
is minimal. From now on h = £(71), h = KT2), h = d(7i(^i), J2Q-2)) and a =-$(j[(0), 7 2 (0)). 

Theorem 1. (Law of cosines) In a complete Riemannian manifold with nonnegative curvature, 
with the notation introduced above, we have 

Z 3 2 < if + 1% - 2hl 2 cos a. (2) 

Proof. See [9] and [33] . □ 

In this paper M will denote a complete n-dimensional Riemannian manifold. 



3 The multicriteria problem 

In this section we present the multicriteria problem, the first order optimality condition for it and 
some basic definitions related. 

Let I := {1, . . . , m}, W£ = {x G R m : Xi > 0,j G 1} and M™ + = {x G R m : xj > 0,j £ /}. For 
x, y G M™, y ^ x (or x <y) means that y — x G and y >- x (or x -< y) means that y — x'ZL W£ + . 

Given a vector continuously differentiable function F : M — > M m , we consider the problem of 
finding a Pareto optimum point of F, i.e., a point p* G M such that there exists no other p G M 
with .F(p) ^ F{p*) an d -^(p) 7^ F{v*)- We denote this unconstrained problem in the Riemannian 
context as 

mm p€M F(p). (3) 
Let F be given by := (fi(p), . . . , f m (p))- We denote the Riemannian jacobian of F by 

grad F(p) := (grad A (p) , . . . , grad f m {p)), p G M, 

and the image of the Riemannian jacobian of F at a point p€ M by 

Im(gradF(p)) := {gradF(p)i> = ((grad hip), v), (grad f m (p), v)) : v G T p M} , p G M. 

Using above equality the first order optimality condition for the problem ([3]) is stated as 

p G M, Im(grad F(p)) n (-M++) = 0. (4) 

Remark 2. Note that the condition in Q generalizes to vector optimization the classical condition 
gr&d F(p) = for the scalar case, i.e., m = 1. 

In general, ([!]) is necessary, but no sufficient for optimality. So, a point p G M satisfying ([3]) is 
called Pareto critical. 
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4 Steepest descent methods for multicriteria problems 

In this section we state the Riemannian steepest descent methods for solving multicriteria problems 
and establish the well definition of the sequence generated for it. 

Let p G M be a point which is not Pareto critical. Then there exists a direction v G T p M 
satisfying 

gradF(p)w G 

that is, grad F(p)v ~< 0. In this case, v is called a descent direction for F at p. 

For each p 6 M, we consider the following unconstrained optimization problem in the tangent 
plane T p M 

mm {max ie /(grad/i(p),t;} + (1/2)||?j|| 2 } , I = {l,...,m}. (5) 

v&TpM 

Lemma 3. The unconstrained optimization problem in ([5]) has only one solution. Moreover, the 
vector v is the solution of the problem in © if only if there exist a, > ; i G I(p,v), such that 

v = - ^2 a ig ia dfi{p), ^2 at = 1, 

i€:I(p,v) i£l(p,v) 

where I(p,v) := {i £ I : (grad fi(p), v) = max ie /(grad fi(p), v )}. 
Proof. Since the function 

T p M 3»4 max ig/ (grad /i (>),-(;), 

is the maximum of linear functions in the linear space T p M, it is convex. So, it is easy to see that 
the function 

T p M m^ ieI (grad fi(p),v) + (1/2)|M| 2 , (6) 

is strong convex, which implies that the problem in ([5|) has only one solution in T p M and the first 
statement is proved. 

From convexity of the function in Q , it is well know that v is solution of the problem in ([5]) if 
only if 

G d (max ie /(grad/i(p), . ) + (1/2) j| . || 2 ) (v), 

or equivalently, 

-v £ 9(max ie /(grad/j(p), . )) (v). 

Therefore, the second statement follows of the formula for the subdifferential of the maximum of 
convex functions (see [18J, Volume I, Corollary VI. 4. 3. 2). □ 

Lemma 4. If p G M is not Pareto critical of F and v is the solution of the problem in ([5|) ; then 

max i€l {grad fi(p),v) + (1/2) |M| 2 < 0. 

In particular, v is a descent direction. 
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Proof. Since p is not Pareto critical, there exists / i £ T p M such that grad F(p)v -< 0. In 
particular, 

P = max ie /(grad/i(p),t)) < 0. 
As — /3/||£|| 2 > 0, letting v = (—P/\\v\\ 2 )v we obtain 

P 2 

max i6/ (grad/j(p),u) + (l/2)||v|| 2 = -7^775 < 0, 

Using that v is the solution of the problem in ([5]), the first part of the lemma follows from last 
inequality. The second part of the lemma is an immediate consequence of the first one. □ 

In view of the two previous lemmas and ([5]) we define the steepest descent direction function 
for F as follows. 

Definition 5. The steepest descent direction function for F is defined as 

M 3 pt — > v(p) := argmin„ 6TpAf {max ie/ (grad/i(p),v) + (1/2) ||v|| 2 } G T p M. 

Remark 6. As an immediate consequence of Lemma\M it follows that the steepest descent direction 
for vector functions becomes the steepest descent direction when m = 1. See, for example, [?]/, ]EBj, 
$31f . j34f and J$5j/. In the case M = W 1 we retrieve the steepest descent direction proposed in 

The steepest descent method with Armijo rule for solving the unconstrained optimization prob- 
lem ([3]) is as follows: 

Method 1 (Steepest descent method with Armijo rule). 

Initialization. Take p e (0, 1) and p G M. Set k = 0. 

Stop criterion. If p k is Pareto critical STOP. Otherwise. 

Iterative Step. Compute the steepest descent direction v k for F at p k , i.e., 

v k := v(p k ), (7) 

and the steplength t^ G (0, 1] is of the following way: 

t k := max ^2~ j : j G N, F (^exp pk (2~ j v k )^ < F(p k ) + P2~ j grad F(p k )v k } , (8) 

and set 

p k+1 :=ex Ppk (t k v k ), (9) 

and GOTO Stop criterion. 

Remark 7. The steepest descent method for vector optimization in Riemannian manifolds becomes 
the classical steepest descent method when m = 1, which has appeared, for example, in J?]/, Jffij/ . 
and JSf. 
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Proposition 8. The sequence {p k } generated by steepest descent method with Armijo rule is well 
defined. 

Proof. Assume that p k is not Pareto critical. From Definition [5] and Lemma El v k = v(p k ) is well 
denned. Thus, for proving the well definition of the method proposed it is enough proving well 
definition of the steplength. For this, first note that from Definition [5] and Lemma [4] 

grad F(p k )v k -< 0. 

Since F : M — > W 11 is a continuously differentiable vector function, grad F{p k )v k -< and /3 € (0, 1) 
we have 

F (exp ri k(tv k )) — F(p k ) . . , . 

lim V Pp V '1 — = grad F(p k )v k -< f3 grad F(p k )v k -< 0. 

t^o+ t 

Therefore, it is straightforward to show that there exists 5 € (0, 1] such that 

F (exp pk {tv k )} < F{p k ) + fit grad F(p k )v k , t G (0, 8). 

As linij-j.oo = 0, last vector inequality implies that the steplength ([8]) is well defined. Hence 
p k+1 is also well defined and the proposition is concluded. □ 

5 Convergence analysis 

In this section, following the ideas of [14] we prove a partial convergence result without any addi- 
tional assumption on F besides the continuity differentiable. In the sequel, following [17], assuming 
quasi-convexity of F and not-negative curvature for M, we extend to optimization of vector func- 
tions the full convergence result presented in [7] and [28]. It is immediate to see that, if the MethodQ] 
terminates after a finite number of iterations, it terminates at a Pareto critical point. From now 
on, we will assume that {p k }, {v k } and {t^} are infinite sequences generated by Method [0 

5.1 Partial convergence result 

In this subsection we prove that every accumulation point of {p k } is a Pareto critical point. Before 
this, we prove the following preliminary fact that will be useful. 

Lemma 9. The steepest descent direction function for F , M 3 p i— )• v(p) £ T p M, is continuous. 

Proof. Let {q k } C M be a sequence which converges to q as k goes to +oo, and Uq C M a 
neighborhood of q such that TUq ~ Uq x W 1 . Since {q k } converges to q and TUq C TM is an open 
set, we assume that the whole sequence {(q k , v(q k ))} is in TUq. Define v k := v(q k ). Combining 
Definition [5] with Lemma H] it is easy to see that 

||?; fe || < 2max ie7 || grad fj (q k )\\. 
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As F is continuously differentiable and {q k } is convergent, above inequality implies that the se- 
quence {v k } is bounded. Let v be an accumulation point of the sequence {v k }. From Definition [5] 
and Lemma[3]we conclude that there exist a k > 0, i € I(q ,v ), such that 

v k = - Yl ajgrad/ifc/*), Yl a i =1 > k = 0,1, (10) 

i£l(q k ,v k ) i£l(q k ,v k ) 

where I(q k ,v k ) := {i € I : (grad fi(q k ), v k ) = maxj G /(grad fi{q k ), v k }}. Using above constants and 
the associated indexes, define the sequence {a k } as 

a k := (a k l ,...,a k m ), a k = 0, i E I \ I(q k ,v k ), fc = 0, 1, 

Let || . ||i be the sum norm in M. m . Since ^«g/(g fc v k ) a i = we nave ll^lli = 1 f° r an k, which 
implies that the sequence {a k } is bounded. Let a be an accumulation point of the sequence {a k }. 
Let {v ks } and {a ks } be subsequences of {v k } and {a k } respectively, such that 

lim v ks = v, lim a ks = a. 

s— >+oo s— >+oo 

As the index set I is finite and I(q ks ,v ks ) C / for all s, we assume without loss of generality that 

I(q k \v kl ) = I{q k2 ,v k2 ) = ... = I. (11) 
Hence, we conclude from (jTUJ) and last equalities that 

v k s = - Y a ks giadf i(q k °), ^aj* = 1, s = 0, 1, . . . . 

Letting s goes to +oo in the above equalities, we obtain 

u = ^a»grad/j(g), ^a» = l. (12) 

On the other hand, I(q ks ,v ks ) = {i € I : (grad fi(q ks ), v ks ) = max ig /(grad fi(q ks ), v ka )}. So, 
equation (fTT]l implies that 

(grad fi(q k *),v ks ) =max ieI (gradfi(q k3 ),v k3 ), i € I, s = 0, 1, . . . . 

Using continuity of grad -F and last equality we have 

(grad/i(g),v) = max ie /(grad/i(g),v), i G 7. 

From the definition of I(q,v) we obtain I C I(q,v). Therefore, combining again Definition [5] with 
Lemma [3] and (fT2|) . we conclude that v = v{q) and the desired result is proved. □ 
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In the next result, we just use that F is continuously differentiable to assure that the sequence 
of the functional values of the sequence {p k }, {F(p k )}, it is monotonous decreasing and that their 
accumulation points are critical Pareto. 

Theorem 10. The following statements there hold: 

i) {F(p k )} is decreasing; 

ii) Each accumulation point of the sequence {p k } is a Pareto critical point. 
Proof. The iterative step in the Method [T] implies that 

F{p k+l )<F{p k ) + ^t k g^dF{p k )v k , p k+1 = exp pfc t k v k , k = 0,1, (13) 

Since {p k } is a infinite sequence, for all k, p k is not Pareto critical of F. Thus, item i follows from 
definition of v k together with Definition [5l Lemma H] and last vector inequality. 

Let p € M be an accumulation point of the sequence {p k } and {p ks } a subsequence of {p k } such 
that lim s _>. +00 p ks = p. Since F is continuous and lim s _s. +00 p ks = p we have lim s _>. +00 F(p ks ) = F(p). 
So, taking into account that {F(p k )} is a decreasing sequence and has F{p) as an accumulation 
point, it is easy to conclude that the whole sequence {F(p k )} converges to F(p). Using the equation 
(|13[) . Definition [5] and Lemma [H we conclude that 

F(p k+1 ) - F(p k ) * pt h grad F{p k )v k ■< 0, fc = 0, 1, 

Since \\m s ^ +00 F(p k ) = F(p), last inequality implies that 

lim (3t k grad F(p k )v k = 0. (14) 

As {p ks } converges to p, we assume that {(p ks ,v ks )} C TUp, where Up is a neighborhood of p such 
that TUp m Up x W 1 . Moreover, as the sequence {t k } C (0, 1] has an accumulation point i € [0, 1], 
we assume without loss of generality that {t ka } converges to t. We have two possibilities to consider: 

a) i > 0; 

b) i=Q. 

Assume that the item a holds. In this case, from (I14p . continuity of grad-F, ([7]) and Lemma El we 
obtain 

gr&dF(p)v(p) = 0, 

which implies that 

max ieI (giadfi(p),v(p)) = 0. (15) 
On the other hand, from Definition [5] together with Lemma HI 

max i6J (grad W fe *) + (1/2)||^|| 2 < 0. 
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Letting s goes to +00 in the above inequalities and using Lemma [9] combined with the continuity 
of grad F and equality (fT5j) , we conclude that 

max ie/ (grad/ 4 (p), V (p)) + (l/2)\\v(p)\\ 2 = 0. 

Hence, it follows from last equality, Definition [5] and Lemma H] that p is a Pareto critical. 
Now, assume that the item b holds. Since for all s p ks is not a Pareto critical, we have 

max ie/ (grad/ i (p fca ),/ a ) < max ie/ (grad f l {p ks ), v ks ) + (1/2) \\v ka || 2 < 0, 

where the last inequality is consequence from Definition [5] together with Lemma HI Hence, letting 
s goes to +00 in the last inequalities, using (J7|) and Lemma [9] we obtain 

max ie/ (grad/i(p),u(p)} < maxj e j(grad/j(p), v(p)) + (l/2)\\v(p)\\ 2 < 0. (16) 

Take r € N. Since {tk s } converges to i = 0, we conclude that for s large enough, 

t ks < 2- r . 

From ([8]) this means that the Armijo condition (|13p is not satisfied for t = 2~ r , i.e., 



F(exp pk (2~ j v ks )) £ F{p ka ) + /32- r gradF(/ s )/ 



which means that there exists at least one iq € I such that 

/ i0 (exp pfcs (2-V*)) > h ( P ka ) + /32-'(grad/ JO (^),^ s )- 

Letting s goes to +00 in the above inequality, taking into account that grad F and exp are continuous 
and using Lemma [9j we obtain 

/ io (ex Pf (2- r t;(p))) > f io (p) +(32- r (gr a df l0 (p),v(p)). 

Last inequality is equivalent to 

/, (ex Pg (2-Mp)))-/„,(p) ^ / , (grad/(o(ji)! „ (p - )) , 

which, letting r goes to +00 and using that < j3 < 1 yields (grad/j (p), v(p)) > 0. Hence, 

maxj e /(grad/,,(p),u(p)) > 0. 

Combining last inequality with (|16p . we have 

max ie/ (grad/ 4 (p), V (p)) + (l/2)||?;(p)|| 2 = 0. 

Therefore, again from Definition \5\ and Lemma U] it follows that p is a Pareto critical and the proof 
is concluded. □ 
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Remark 11. If the sequence {p k } begins in a bounded level set, for example, if 

L F (F(p )) :={peM: F(p) < F(po)}, 

is a bounded set, and being F a continuous function, Hopf-Rinow's theorem assures that Lf{F{pq)) 
is a compact set. So, item i of Theorem [70l implies that {p k } C Lp(F(po)) and consequently {p k } is 
bounded. In particular, {p k } has at least one accumulation point. Therefore, Theorem \10\ extends 
for vector optimization the results of Theorem 5.1 of |?|/. See also Remark 4.5 of fJE/. 

5.2 Full convergence 

In this section under the quasi-convexity assumption on F and not-negative curvature for M, full 
convergence of the steepest descent method is obtained. 

Definition 12. Let H : M —> M. m be a vectorial function. 

i) H is called convex on M if for every p,q 6 M and every geodesic segment 7 : [0, 1] — > M 
joining p to q (i.e., 7(0) = p and 7(1) = q), it holds 

H( 7 (t)) ± (1 - t)H{p) + tH(q), t G [0, 1]. 

ii) H is called quasi-convex on M if for every p,q E M and every geodesic segment 7 : [0, 1] — >• M 
joining p to q, it holds 

H( 7 (t)) ± m a x{H(p),H(q)}, t G [0, 1], 
where the maximum is considered coordinate by coordinate. 

Remark 13. The first above definition is a natural extension of the definition of convexity while 
the second is an extension of a characterization of the definition of quasi-convexity, of the Euclidean 
space to the Riemannian context. See Definition 6.2 and Corollary 6.6 of \2$ , pages 29 and 31 
respectively. Note that the above definitions are equivalent, respectively, H to be convex and quasi- 
convex along every geodesic segment. Thus, when m = 1 these definitions merge into the scalar 
convexity and quasi-convexity defined in [35], respectively. Moreover, it is immediate of the above 
definitions that if H is convex then it is quasi-convex. In the case that H is differentiable, convexity 
of H implies that for every p,q G M and every geodesic segment 7 : [0, 1] — > M such that 7(0) = p 
and 7(1) = q, 

gradi7(p)7'(0) ^ H{q) - H{p). 

Proposition 14. Let H : M — > W 11 be a differentiable quasi-convex function. Then, for every 
p,q G M and every geodesic segment 7 : [0, 1] — > M joining p to q, it holds 

H{q) < H{p) gradi?(p)7'(0) < 0. 
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Proof. Take p,q G M such that H{q) -< H{p) and a geodesic segment 7 : [0,1] — > M such that 
7(0) = p and 7(1) = q. Since H is quasi-convex, we have 

H( 7 (t))±H(p), te[o,l]. 

Using last inequality the result is an immediate consequence from the differentiability of H. □ 

We know that criticality is necessary condition but not sufficient for optimality. However, under 
convexity of the vectorial function F we proved that criticality is equivalent to the weak optimality. 

Definition 15. A point p* G M is a weak Pareto optimal of F if there is no p G M with F{p) -< 
F(p*). 

Proposition 16. Let H : M — > M m be a convex continuously differentiable function. Then, p G M 
is a Pareto critical of H, i.e., 

Im(gradH(p)) n (-R++) = 0, 
if and only if p is a weak Pareto optimal of H . 

Proof. Let us suppose that p is Pareto critical of H. Assume by contradiction that p is not weak 
Pareto optimal of H. Since p is not weak Pareto optimal, there exists p £ M such that 

H{p) -< H{p). (17) 

Let 7 : [0, 1] — > M be a geodesic segment joining p to p (i.e., 7(0) = p and 7(1) = p). As iif is 
differentiable and convex, the last part of Remark [13] and (|17p imply that 

gradi7(p) 7 / (0) ■< H{p) - Hip) -< 0. 

But this contradicts the fact of p to be Pareto critical of H, and the first part is concluded. 

Now, let us suppose that p is weak Pareto optimal of H. Assume by contradiction that p is not 
Pareto critical of H. Since p is not Pareto critical, then Im(grad Hip)) n (— ^ 0, that is, there 
exists v € T p M a descent direction for F at p. Hence, from the differentiability of H, we have 

lim H ^PvM)-Hip) = { )v 0] 

which implies that there exists 5 > such that 

H iexp p itv)) -< Hip) +tgr&dHip)v, t G (0, 5). 

Since u is a descent direction for F at p and i G (0, 6) we have i gradHip)v ~< 0. So, the last vector 
inequality yields 

H iexp p itv)) -< jy(p), t G (0, 5), 
contradicting the fact of p to be weak Pareto optimal of H, which concludes the proof. □ 
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Definition 17. A sequence {q k } C M is quasi-Fejer convergent to a nonempty set U if, for all 

p € U, there exists a sequence {e^} C M+ such that 

+00 

^e fc <+oo, d 2 (q k+1 ,q) <d 2 (q k ,q)+e k , k = 0, 1, . . . . 

k=0 

In the next lemma we recall the called quasi-Fejer convergence theorem. 

Lemma 18. Let U C M be a nonempty set and {q k } C M a sequence quasi-Fejer convergent. 
Then, {q k } is bounded. Moreover, if an accumulation point q of {q k } belongs to U, then the whole 
sequence {q k } converges to q as k goes to +00. 

Proof. Analogous to the proof of Theorem 1 in Burachik et al. [6], by replacing the Euclidean 
distance by the Riemannian distance d. □ 

Consider the following set 

U := {p € M : F{p) < F(p k ), A; = 0,1,...}. (18) 

In general, the above set may be an empty set. To guarantee that U is nonempty, an additional 
assumption on the sequence {p k } is needed. In the next remark we give a such condition. 

Remark 19. If the sequence {p k } has an accumulation point, then U is nonempty. Indeed, let p be 
an accumulation point of the sequence {p k }. Then, there exists a subsequence {jri} of {p k } which 
converges to p. Since F is continuous {F(p )} has F{p) as an accumulation point. Hence, using 
that {F(p k )} is a decreasing sequence (see item i of Theorem fTOj) usual arguments show easily that 
the whole sequence {F(p k )} converges to F(p) and there holds 

F{p)<F{p k ), k = 0,l,..., 

which implies that p £ U, i.e., U ^ 0. 

In the next lemma we presented the main result of this section. It is fundamental in the proof 
of the global convergence result of the sequence {p k }- 

Lemma 20. Suppose that F is quasi-convex, M has not-negative curvature and U , defined in (|18p . 
is nonempty. Then, for all p € U , the inequality there holds: 

d\p k+ \p) <d 2 (p k ,p) + tt\\v k \\ 2 . 

Proof. Consider the geodesic hinge (71,72,0;), where 71 is a normalized minimal geodesic segment 
joining p k to p, 72 is the geodesic segment joining p k to p k+1 such that 7 2 (0) = tkV k and a = 
Z(7 1 (0),v fc ). By the law of cosines (Theorem[TJ, we have 

d 2 (p k+1 ,p) < d 2 (p k ,p) + t 2 k \\v k \\ 2 - 2d(p k ,p)t k \\v k \\ cos a, k = 0,1, 
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Thus, taking into account that cos(-7r — a) = — cos a and (— v k ,y i (0)) = \\v k \\cos(ir — a), above 
vector inequality becomes 

d\p k+ \p) < d 2 (p k ,p) + 2d(p k ,p)t k (-v k , 7 ' 1 (0)), k = 0,1, 

On the other hand, from ([7]), Definition [5] and LemmaH there exist a k > 0, with i G I k := I(p k ,v k ), 
such that 

v k = -^a ig rad fi(p k ), £o£ = l, A; = 0, 1, . . . . 
ielk *g4 
Hence, last vector inequality yields 



d\p k+ \p) < d\p k ,p) + 2d(p k ,p)t k £ a k (grad/,(p fc ), 7 U0)), fe = 0, 1, (19) 

ieik 

Since F is quasi-convex and p € U, from Proposition 1141 with H = F, p = p k , q = p and 7 = 7i, 
we have 

gradF(/) 7 ;(0) ^0, A; = 0,1,..., 

or equivalently, 

(grad fi(p k ), 7 i(0)) <0, i = l,...,m, A; = 0, 1, (20) 

Therefore, for combining f)19|) with (|20p . the lemma follows. □ 

Proposition 21. If F is quasi-convex, M has not-negative curvature and U, defined in (|18p . is a 

nonempty set, then the sequence {pk} is Fejer convergent to U . 

Proof. For simplify the notation define the scalar function ip : M m — > R as follows 

<f(y) = max ie /(7/,ei), J={l,...,m}. 

where {e^} C M m is the canonical base of the space M m . It easy to see that the following properties 
on the function <p hold: 

ip{x + y) <(p(x) + tp(y), ip(tx) = t(p(x), x,y £R m , t>0. (21) 

x<y <p(x)<<p(v), x,yeR m . (22) 

From the definition of t k in (|8|) and p k+l in Q, we have 

F(p k+1 ) ^ F(p k ) + (3t k giadF(p k )v k , k = 0,1 

Hence, using (|2Tj) . (|22|) and last inequality, we obtain 

^(F(p fe+1 ))< V (F(p fe ))+/3^(gradF(pV)> A; = 0, 1 (23) 
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On the other hand, combining definition of v k in ([7]), Definition [5l Lemma 0] and definition of ip, 
we conclude that 

y>(grad F{p k )v k ) + (l/2)||i; fc || 2 < 0, k = 0,1. . . , 
which together with (|23|) implies that 

v(F(p k+1 )) < v(F(p k )) - (f3t k /2)\\v k f, k = 0,1 ... , 
But this tells us that, 

t k \\v k f < 2[<p(F(p k )) - v(F(p k+1 ))]/f3, k = 0,1 

As t k € (0, 1], follows that 

t 2 ||/|| 2 < 2[ V (F(p k )) - v(F(p k+1 ))]/P, k = 0,l.... 
Thus, the latter inequality implies easily that 

n 

Y,tl\\v k f < 2 [<p(F(p°) - v{F{p n+l ))} IP, n > 0. 

fc=0 

Take p E U. Then, F(p) ^ F(p n+1 ). So, from (|22} <p(F(p)) < <p{F{p n+l )) and last inequality 
yields 

n 

Y,tl\\v k \\ 2 <2(<p(F(p )-v(F(p)))/f3. 

k=0 

which implies that {t? ||w fc || 2 } is a summable sequence. Therefore, the desired result follows from 
Lemma [20] combined with Definition [T71 □ 



Theorem 22. // F is quasi-convex, M has not-negative curvature and U, defined in (|18p . is a 
nonempty set, then the sequence {pk} converges to a Pareto critical of F. 

Proof. Prom Proposition [211 {p k } is Fejer convergent to U. Thus Lemma [TH1 guarantees that {p k } is 
bounded and, from Hopf-Rinow' theorem, there exists {p ks }, subsequence of {p k }, which converges 
to p G M as s goes to +oo. Since F is continuous and {F(p k )} is a decreasing sequence (see item 
i of Theorem I10p . we conclude that F{p k ) converges to F(p) as k goes to +oo, which implies that 

F{p)<F{p k ), k = 0,l,..., 

i.e., p £ U. Hence, from Lemma [18] we conclude that the whole sequence {p k } converges to p as k 
goes to +oo, and the conclusion of the proof it is consequence of the item ii of Theorem 1101 □ 

Corollary 23. If F is convex, M has not-negative curvature and U, defined in (I18p . is a nonempty 
set, then the sequence {pk} converges to a weak Pareto optimal of F. 

Proof. Since F is convex, in particular, it is quasi-convex (see Remark 1 13p . Thus the corollary is a 
consequence of the previous theorem and Proposition [161 D 
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6 Examples 



In this section we present some examples of complete Riemannian manifolds with explicit geodesic 
curves and the steepest descent iteration of the sequence generated by the method [TJ We recall that 
F : M — > M m , F(p) := (fi(p), • • • , f m (p)), is a differentiable function. If (M, G) is a Riemannian 
manifold then the Riemannian gradient of /j is given by grad/j(p) = G{p)~ l flip), i G / := 
{l,...,n}. Hence, if v(p) is the steepest descent direction for F at p (see Definition [5]), from 
LemmalU there exist constants a» > 0, i € I(p,v), such that 

v = - <*iG(p)~ x ffo\ E ^ = 1 ' ( 24 ) 

i(Hl(p,v) i£l(p,v) 

where I(p,v) := {i € I : (G(p)- 1 f-(p),v) = max i6/ (G(p)- 1 /i (p), «)}■ 



6.1 A steepest descent method for E" + 

Let M be the positive octant, W+ + , endowed with the Riemannian metric 

M 3 v ^ G(p) = P~ 2 := diag (p^ 2 , . . . ,p" 2 ) , 

(metric induzed by the Hessian of the logarithmic barrier). Since (M, G) is isometric to the Eu- 
clidean space endowed with the usual metric (see, Da Cruz Neto et al. [8]) it follows that M has con- 
stant curvature equal to zero. On the other hand, it is easy to see that the unique geodesic p = pit) 
such that p(0) = p° = (p®, . . . ) and p'(0) = v = (v®, . . . , v®) is given by pit) = (pi(t), . . . ,p n (t)), 
where 

Pj (t) = pJeC^M)*, j = 1, . . . , n. (25) 

So, we conclude that (M,G) is also complete. In this case, from ([25]) and (Oil) , there exist a* > 
such that the steepest descent iteration of the sequence generated by the method Q] is given by 



tf 1 v) = - £ a*(p*) 2 fkA £ «? = 1, i = l,...,n. 

i£l(p k ,v k ) ' '' i£l(p k ,v k ) 



6.2 A steepest descent method for the hypercube 

Let M be the hypercube (0, 1) x . . . x (0, 1) endowed with the Riemannian metric 

M3v^ G(p) = P- 2 (I - Py 2 := diag ((px) 2 (l - Pl ) 2 , (p n ) 2 (l - p n ) 2 ) , 

(metric induzed by the Hessian of the barrier b(p) = Yl^ii^Pi ~ l)(l n Pi ~~ l n (l ~~ Pi))- The Rie- 
mannian manifold (M, G) is complete and the geodesic p = p{t), satisfying p(0) = p° = {p\, . . . ,p°) 
and p'(0) = v°0 = (v%, . . . is given by p{t) = (pi(i), . . . ,p n (t)), 



Pj (t) = (1/2) 



1 + tanh (1/2) 



Pj0--Pj) 



t+ (1/2) In 



Pj 



Pj 



1, ...,n, 



(26) 
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where tanh(z) := (e z — e~ z )/(e z + e~ z ). Moreover (M,G) has constant curvature equal to zero, 
see Theorem 3.1 and 3.2 of [29]. In this case, from (|26p and (|24p . there exist a k > such that the 
steepest descent iteration of the sequence generated by the method [1] is given by 



(1/2) 



1 + tanh (1/2) 



p k (l -p 



r-t fc + (l/2) In 



Pi 



with, 



E «^) 2 a-^ 2 gV), E 

i£l(p k ,v k ) 3 iel(p k ,v k ) 



l,...,n, 



1, . . . ,n. 



6.3 steepest descent method for the cone of positive semidefinite matrices 

Let § n be the set of the symmetric matrices n x n, the cone of the symmetric positive semi- 
definite matrices and S" + the cone of the symmetric positive definite matrices. Following Rothaus 
|32j . let M = §+ + be endowed with the Riemannian metric induced by the Euclidean Hessian of 
*&(X) = — IndetX, i.e., G(X) := ^>"(X). In this case, the unique geodesic segment connecting any 
X, Y G M is given by 



X(t) = X 1 ' 2 [X-^YX- 1 ' 2 ) X 1 ' 2 , t G [0, 1], 

see [27] . More precisely, M is a Hadamard manifold (with curvature not identically zero), see 
for example [20], Theorem 1.2, page 325. In particular, the unique geodesic X = X(t) such that 
X{0) = X and X'(0) = V is given by 



X(t) = xV^x-^x-^X 1 /*. 



(27) 



Thus, from (|27|) and (f24l) . there exist a 1 * > such that the steepest descent iteration of the sequence 
generated by the method [T] is given by 



with, 



X 



v k = - 



k+l 



( X fe)l/2 e t fc (X*)-V2y*(X*)-V2^^1/ 2) 



E afX k fl(X k )X k , E 



1. 



i£l(X k ,V k ) 



i€i(x k y k ) 



Remark 24. Under the assumption of convexity on the vector function F, if(M,G) is the Rieman- 
nian manifod in the first or in the second example, then Corollary {23\ assures the full convergence 
of the sequence generated by MethodUl This fact doesn't necessarily happens if (M, G) is the Rie- 
manniana manifold in the last example, since in this case (M,G) has curvature not-positive, i.e., 
K < 0. However, Theorem{W^ assures at least partial convergence. 
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7 Final remarks 



We have extended the steepest descent method with Armijo's rule for multicriteria optimization to 
the Riemannian context. Full convergence is obtained under to the assumptions of quasi-convexity 
of the multicriteria function and non-negative curvature of the Riemannian manifold. A subject in 
open is to obtain the same result without restrictive assumption on the curvature of the manifold. 
Following the same line of this paper, as future propose we have the extension, to the context 
Riemannian, of the proximal method (see Bonnel et al. [5]) and Newton method (see Fliege et al. 
[13]). both for multiobjective optimization. 
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